Overview

Dataset statistics

Number of variables42
Number of observations20000
Missing cells32435
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 MiB
Average record size in memory336.0 B

Variable types

Numeric17
Categorical21
Boolean4

Warnings

grau_instrucao has constant value "0" Constant
possui_telefone_celular has constant value "False" Constant
codigo_area_telefone_residencial has a high cardinality: 81 distinct values High cardinality
codigo_area_telefone_trabalho has a high cardinality: 77 distinct values High cardinality
qtde_contas_bancarias is highly correlated with qtde_contas_bancarias_especiaisHigh correlation
qtde_contas_bancarias_especiais is highly correlated with qtde_contas_bancariasHigh correlation
local_onde_reside is highly correlated with local_onde_trabalhaHigh correlation
local_onde_trabalha is highly correlated with local_onde_resideHigh correlation
possui_cartao_amex is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
nacionalidade is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_telefone_celular is highly correlated with possui_cartao_amex and 23 other fieldsHigh correlation
possui_outros_cartoes is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
codigo_area_telefone_residencial is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
qtde_contas_bancarias is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
forma_envio_solicitacao is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_cartao_mastercard is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_cartao_visa is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
sexo is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
tipo_endereco is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
estado_onde_reside is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_email is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
vinculo_formal_com_empresa is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
inadimplente is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_carro is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
possui_cartao_diners is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
estado_onde_nasceu is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
produto_solicitado is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
qtde_contas_bancarias_especiais is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
grau_instrucao is highly correlated with possui_cartao_amex and 23 other fieldsHigh correlation
possui_telefone_residencial is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
codigo_area_telefone_trabalho is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
possui_telefone_trabalho is highly correlated with possui_telefone_celular and 2 other fieldsHigh correlation
estado_onde_trabalha is highly correlated with possui_telefone_celular and 1 other fieldsHigh correlation
tipo_residencia has 536 (2.7%) missing values Missing
meses_na_residencia has 1450 (7.2%) missing values Missing
profissao has 3097 (15.5%) missing values Missing
ocupacao has 2978 (14.9%) missing values Missing
profissao_companheiro has 11514 (57.6%) missing values Missing
grau_instrucao_companheiro has 12860 (64.3%) missing values Missing
renda_mensal_regular is highly skewed (γ1 = 67.75421325) Skewed
renda_extra is highly skewed (γ1 = 137.4095781) Skewed
valor_patrimonio_pessoal is highly skewed (γ1 = 126.6995194) Skewed
meses_no_trabalho is highly skewed (γ1 = 63.19895877) Skewed
id_solicitante is uniformly distributed Uniform
inadimplente is uniformly distributed Uniform
id_solicitante has unique values Unique
qtde_dependentes has 13350 (66.8%) zeros Zeros
tipo_residencia has 331 (1.7%) zeros Zeros
meses_na_residencia has 1858 (9.3%) zeros Zeros
renda_extra has 18930 (94.7%) zeros Zeros
valor_patrimonio_pessoal has 19072 (95.4%) zeros Zeros
meses_no_trabalho has 19973 (99.9%) zeros Zeros
profissao has 1398 (7.0%) zeros Zeros
ocupacao has 1114 (5.6%) zeros Zeros
profissao_companheiro has 5551 (27.8%) zeros Zeros
grau_instrucao_companheiro has 6485 (32.4%) zeros Zeros

Reproduction

Analysis started2021-05-15 21:58:43.272122
Analysis finished2021-05-15 21:59:42.589101
Duration59.32 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

id_solicitante
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct20000
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10000.5
Minimum1
Maximum20000
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum1
5-th percentile1000.95
Q15000.75
median10000.5
Q315000.25
95-th percentile19000.05
Maximum20000
Range19999
Interquartile range (IQR)9999.5

Descriptive statistics

Standard deviation5773.647028
Coefficient of variation (CV)0.577335836
Kurtosis-1.2
Mean10000.5
Median Absolute Deviation (MAD)5000
Skewness0
Sum200010000
Variance33335000
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20471
 
< 0.1%
109121
 
< 0.1%
129471
 
< 0.1%
27081
 
< 0.1%
6611
 
< 0.1%
68061
 
< 0.1%
47591
 
< 0.1%
191001
 
< 0.1%
170531
 
< 0.1%
88651
 
< 0.1%
Other values (19990)19990
> 99.9%
ValueCountFrequency (%)
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
ValueCountFrequency (%)
200001
< 0.1%
199991
< 0.1%
199981
< 0.1%
199971
< 0.1%
199961
< 0.1%

produto_solicitado
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
1
17023 
2
2435 
7
 
542

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row7
ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%
Histogram of lengths of the category
ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%

Most occurring characters

ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
117023
85.1%
22435
 
12.2%
7542
 
2.7%

dia_vencimento
Real number (ℝ≥0)

Distinct6
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.14725
Minimum1
Maximum25
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum1
5-th percentile5
Q110
median10
Q320
95-th percentile25
Maximum25
Range24
Interquartile range (IQR)10

Descriptive statistics

Standard deviation6.748506839
Coefficient of variation (CV)0.5133017809
Kurtosis-0.7233846608
Mean13.14725
Median Absolute Deviation (MAD)5
Skewness0.441538168
Sum262945
Variance45.54234455
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
107847
39.2%
153557
17.8%
253089
 
15.4%
52825
 
14.1%
201952
 
9.8%
1730
 
3.6%
ValueCountFrequency (%)
1730
 
3.6%
52825
 
14.1%
107847
39.2%
153557
17.8%
201952
 
9.8%
ValueCountFrequency (%)
253089
 
15.4%
201952
 
9.8%
153557
17.8%
107847
39.2%
52825
 
14.1%

forma_envio_solicitacao
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
internet
11264 
presencial
7855 
correio
 
881

Length

Max length10
Median length8
Mean length8.74145
Min length7

Characters and Unicode

Total characters174829
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowpresencial
2nd rowinternet
3rd rowinternet
4th rowinternet
5th rowinternet
ValueCountFrequency (%)
internet11264
56.3%
presencial7855
39.3%
correio881
 
4.4%
Histogram of lengths of the category
ValueCountFrequency (%)
internet11264
56.3%
presencial7855
39.3%
correio881
 
4.4%

Most occurring characters

ValueCountFrequency (%)
e39119
22.4%
n30383
17.4%
t22528
12.9%
r20881
11.9%
i20000
11.4%
c8736
 
5.0%
p7855
 
4.5%
s7855
 
4.5%
a7855
 
4.5%
l7855
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter174829
100.0%

Most frequent character per category

ValueCountFrequency (%)
e39119
22.4%
n30383
17.4%
t22528
12.9%
r20881
11.9%
i20000
11.4%
c8736
 
5.0%
p7855
 
4.5%
s7855
 
4.5%
a7855
 
4.5%
l7855
 
4.5%

Most occurring scripts

ValueCountFrequency (%)
Latin174829
100.0%

Most frequent character per script

ValueCountFrequency (%)
e39119
22.4%
n30383
17.4%
t22528
12.9%
r20881
11.9%
i20000
11.4%
c8736
 
5.0%
p7855
 
4.5%
s7855
 
4.5%
a7855
 
4.5%
l7855
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII174829
100.0%

Most frequent character per block

ValueCountFrequency (%)
e39119
22.4%
n30383
17.4%
t22528
12.9%
r20881
11.9%
i20000
11.4%
c8736
 
5.0%
p7855
 
4.5%
s7855
 
4.5%
a7855
 
4.5%
l7855
 
4.5%

tipo_endereco
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
1
19873 
2
 
127

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%
Histogram of lengths of the category
ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%

Most occurring characters

ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
119873
99.4%
2127
 
0.6%

sexo
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
F
12246 
M
7722 
N
 
25
 
7

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters4
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowM
2nd rowF
3rd rowF
4th rowM
5th rowF
ValueCountFrequency (%)
F12246
61.2%
M7722
38.6%
N25
 
0.1%
7
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
f12246
61.3%
m7722
38.6%
n25
 
0.1%

Most occurring characters

ValueCountFrequency (%)
F12246
61.2%
M7722
38.6%
N25
 
0.1%
7
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter19993
> 99.9%
Space Separator7
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
F12246
61.3%
M7722
38.6%
N25
 
0.1%
ValueCountFrequency (%)
7
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin19993
> 99.9%
Common7
 
< 0.1%

Most frequent character per script

ValueCountFrequency (%)
F12246
61.3%
M7722
38.6%
N25
 
0.1%
ValueCountFrequency (%)
7
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
F12246
61.2%
M7722
38.6%
N25
 
0.1%
7
 
< 0.1%

idade
Real number (ℝ≥0)

Distinct84
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.3525
Minimum7
Maximum106
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum7
5-th percentile21
Q131
median40
Q352
95-th percentile70
Maximum106
Range99
Interquartile range (IQR)21

Descriptive statistics

Standard deviation14.93017713
Coefficient of variation (CV)0.3525217433
Kurtosis-0.210705359
Mean42.3525
Median Absolute Deviation (MAD)10
Skewness0.5584304521
Sum847050
Variance222.9101893
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
40555
 
2.8%
39534
 
2.7%
36526
 
2.6%
32518
 
2.6%
37513
 
2.6%
43510
 
2.5%
28509
 
2.5%
33504
 
2.5%
31503
 
2.5%
38500
 
2.5%
Other values (74)14828
74.1%
ValueCountFrequency (%)
71
 
< 0.1%
177
 
< 0.1%
18265
1.3%
19260
1.3%
20293
1.5%
ValueCountFrequency (%)
1062
< 0.1%
1001
 
< 0.1%
971
 
< 0.1%
962
< 0.1%
954
< 0.1%

estado_civil
Real number (ℝ≥0)

Distinct8
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.12085
Minimum0
Maximum7
Zeros81
Zeros (%)0.4%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile5
Maximum7
Range7
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.33200375
Coefficient of variation (CV)0.6280518423
Kurtosis2.799170933
Mean2.12085
Median Absolute Deviation (MAD)0
Skewness1.76004596
Sum42417
Variance1.774233989
MonotocityNot monotonic
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
210088
50.4%
16519
32.6%
41573
 
7.9%
6763
 
3.8%
5522
 
2.6%
3234
 
1.2%
7220
 
1.1%
081
 
0.4%
ValueCountFrequency (%)
081
 
0.4%
16519
32.6%
210088
50.4%
3234
 
1.2%
41573
 
7.9%
ValueCountFrequency (%)
7220
 
1.1%
6763
3.8%
5522
 
2.6%
41573
7.9%
3234
 
1.2%

qtde_dependentes
Real number (ℝ≥0)

ZEROS

Distinct15
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6664
Minimum0
Maximum53
Zeros13350
Zeros (%)66.8%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum53
Range53
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.23672451
Coefficient of variation (CV)1.855829097
Kurtosis167.6045062
Mean0.6664
Median Absolute Deviation (MAD)0
Skewness5.925042325
Sum13328
Variance1.529487514
MonotocityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
013350
66.8%
12814
 
14.1%
22189
 
10.9%
31029
 
5.1%
4352
 
1.8%
5149
 
0.7%
657
 
0.3%
722
 
0.1%
814
 
0.1%
99
 
< 0.1%
Other values (5)15
 
0.1%
ValueCountFrequency (%)
013350
66.8%
12814
 
14.1%
22189
 
10.9%
31029
 
5.1%
4352
 
1.8%
ValueCountFrequency (%)
531
 
< 0.1%
141
 
< 0.1%
132
 
< 0.1%
114
< 0.1%
107
< 0.1%

grau_instrucao
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
20000 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters1
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
020000
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
020000
100.0%

Most occurring characters

ValueCountFrequency (%)
020000
100.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
020000
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
020000
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
020000
100.0%

nacionalidade
Categorical

HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
1
19152 
0
 
808
2
 
40

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%
Histogram of lengths of the category
ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%

Most occurring characters

ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
119152
95.8%
0808
 
4.0%
240
 
0.2%

estado_onde_nasceu
Categorical

HIGH CORRELATION

Distinct28
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
BA
2351 
SP
2336 
RS
1919 
CE
1910 
PE
1651 
Other values (23)
9833 

Length

Max length2
Median length2
Mean length1.9589
Min length1

Characters and Unicode

Total characters39178
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCE
2nd rowSE
3rd rowBA
4th rowRS
5th rowBA
ValueCountFrequency (%)
BA2351
11.8%
SP2336
11.7%
RS1919
 
9.6%
CE1910
 
9.6%
PE1651
 
8.3%
MG1446
 
7.2%
RN827
 
4.1%
822
 
4.1%
PR764
 
3.8%
RJ720
 
3.6%
Other values (18)5254
26.3%
Histogram of lengths of the category
ValueCountFrequency (%)
ba2351
12.3%
sp2336
12.2%
rs1919
10.0%
ce1910
10.0%
pe1651
 
8.6%
mg1446
 
7.5%
rn827
 
4.3%
pr764
 
4.0%
rj720
 
3.8%
al678
 
3.5%
Other values (17)4576
23.9%

Most occurring characters

ValueCountFrequency (%)
P6421
16.4%
S5129
13.1%
A4723
12.1%
R4313
11.0%
E3965
10.1%
B2959
7.6%
M2744
7.0%
C2373
 
6.1%
G1906
 
4.9%
N827
 
2.1%
Other values (8)3818
9.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter38356
97.9%
Space Separator822
 
2.1%

Most frequent character per category

ValueCountFrequency (%)
P6421
16.7%
S5129
13.4%
A4723
12.3%
R4313
11.2%
E3965
10.3%
B2959
7.7%
M2744
7.2%
C2373
 
6.2%
G1906
 
5.0%
N827
 
2.2%
Other values (7)2996
7.8%
ValueCountFrequency (%)
822
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin38356
97.9%
Common822
 
2.1%

Most frequent character per script

ValueCountFrequency (%)
P6421
16.7%
S5129
13.4%
A4723
12.3%
R4313
11.2%
E3965
10.3%
B2959
7.7%
M2744
7.2%
C2373
 
6.2%
G1906
 
5.0%
N827
 
2.2%
Other values (7)2996
7.8%
ValueCountFrequency (%)
822
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII39178
100.0%

Most frequent character per block

ValueCountFrequency (%)
P6421
16.4%
S5129
13.1%
A4723
12.1%
R4313
11.0%
E3965
10.1%
B2959
7.6%
M2744
7.0%
C2373
 
6.1%
G1906
 
4.9%
N827
 
2.1%
Other values (8)3818
9.7%

estado_onde_reside
Categorical

HIGH CORRELATION

Distinct27
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
SP
3578 
BA
2045 
RS
1995 
CE
1865 
PE
1484 
Other values (22)
9033 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters40000
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCE
2nd rowSE
3rd rowBA
4th rowRS
5th rowBA
ValueCountFrequency (%)
SP3578
17.9%
BA2045
10.2%
RS1995
10.0%
CE1865
9.3%
PE1484
 
7.4%
MG1187
 
5.9%
PA927
 
4.6%
RJ863
 
4.3%
RN846
 
4.2%
GO682
 
3.4%
Other values (17)4528
22.6%
Histogram of lengths of the category
ValueCountFrequency (%)
sp3578
17.9%
ba2045
10.2%
rs1995
10.0%
ce1865
9.3%
pe1484
 
7.4%
mg1187
 
5.9%
pa927
 
4.6%
rj863
 
4.3%
rn846
 
4.2%
go682
 
3.4%
Other values (17)4528
22.6%

Most occurring characters

ValueCountFrequency (%)
P7453
18.6%
S6485
16.2%
R4489
11.2%
A4307
10.8%
E3741
9.4%
B2544
 
6.4%
M2450
 
6.1%
C2204
 
5.5%
G1869
 
4.7%
J863
 
2.2%
Other values (7)3595
9.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter40000
100.0%

Most frequent character per category

ValueCountFrequency (%)
P7453
18.6%
S6485
16.2%
R4489
11.2%
A4307
10.8%
E3741
9.4%
B2544
 
6.4%
M2450
 
6.1%
C2204
 
5.5%
G1869
 
4.7%
J863
 
2.2%
Other values (7)3595
9.0%

Most occurring scripts

ValueCountFrequency (%)
Latin40000
100.0%

Most frequent character per script

ValueCountFrequency (%)
P7453
18.6%
S6485
16.2%
R4489
11.2%
A4307
10.8%
E3741
9.4%
B2544
 
6.4%
M2450
 
6.1%
C2204
 
5.5%
G1869
 
4.7%
J863
 
2.2%
Other values (7)3595
9.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII40000
100.0%

Most frequent character per block

ValueCountFrequency (%)
P7453
18.6%
S6485
16.2%
R4489
11.2%
A4307
10.8%
E3741
9.4%
B2544
 
6.4%
M2450
 
6.1%
C2204
 
5.5%
G1869
 
4.7%
J863
 
2.2%
Other values (7)3595
9.0%

possui_telefone_residencial
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
True
16474 
False
3526 
ValueCountFrequency (%)
True16474
82.4%
False3526
 
17.6%

codigo_area_telefone_residencial
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct81
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
3534 
5
1838 
97
 
1142
107
 
1142
54
 
904
Other values (76)
11440 

Length

Max length3
Median length2
Mean length1.9452
Min length1

Characters and Unicode

Total characters38904
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row107
2nd row91
3rd row90
4th row54
5th row86
ValueCountFrequency (%)
3534
17.7%
51838
 
9.2%
971142
 
5.7%
1071142
 
5.7%
54904
 
4.5%
105646
 
3.2%
84545
 
2.7%
81535
 
2.7%
20534
 
2.7%
58518
 
2.6%
Other values (71)8662
43.3%
Histogram of lengths of the category
ValueCountFrequency (%)
51838
 
11.2%
1071142
 
6.9%
971142
 
6.9%
54904
 
5.5%
105646
 
3.9%
84545
 
3.3%
81535
 
3.2%
20534
 
3.2%
58518
 
3.1%
100494
 
3.0%
Other values (70)8168
49.6%

Most occurring characters

ValueCountFrequency (%)
18002
20.6%
04814
12.4%
54770
12.3%
73969
10.2%
3534
9.1%
23315
8.5%
42423
 
6.2%
82418
 
6.2%
92145
 
5.5%
62036
 
5.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number35370
90.9%
Space Separator3534
 
9.1%

Most frequent character per category

ValueCountFrequency (%)
18002
22.6%
04814
13.6%
54770
13.5%
73969
11.2%
23315
9.4%
42423
 
6.9%
82418
 
6.8%
92145
 
6.1%
62036
 
5.8%
31478
 
4.2%
ValueCountFrequency (%)
3534
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common38904
100.0%

Most frequent character per script

ValueCountFrequency (%)
18002
20.6%
04814
12.4%
54770
12.3%
73969
10.2%
3534
9.1%
23315
8.5%
42423
 
6.2%
82418
 
6.2%
92145
 
5.5%
62036
 
5.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII38904
100.0%

Most frequent character per block

ValueCountFrequency (%)
18002
20.6%
04814
12.4%
54770
12.3%
73969
10.2%
3534
9.1%
23315
8.5%
42423
 
6.2%
82418
 
6.2%
92145
 
5.5%
62036
 
5.2%

tipo_residencia
Real number (ℝ≥0)

MISSING
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing536
Missing (%)2.7%
Infinite0
Infinite (%)0.0%
Mean1.261302918
Minimum0
Maximum5
Zeros331
Zeros (%)1.7%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median1
Q31
95-th percentile3
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.8835795418
Coefficient of variation (CV)0.7005292139
Kurtosis11.37714604
Mean1.261302918
Median Absolute Deviation (MAD)0
Skewness3.408604224
Sum24550
Variance0.7807128068
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
116497
82.5%
21635
 
8.2%
5827
 
4.1%
0331
 
1.7%
4126
 
0.6%
348
 
0.2%
(Missing)536
 
2.7%
ValueCountFrequency (%)
0331
 
1.7%
116497
82.5%
21635
 
8.2%
348
 
0.2%
4126
 
0.6%
ValueCountFrequency (%)
5827
 
4.1%
4126
 
0.6%
348
 
0.2%
21635
 
8.2%
116497
82.5%

meses_na_residencia
Real number (ℝ≥0)

MISSING
ZEROS

Distinct76
Distinct (%)0.4%
Missing1450
Missing (%)7.2%
Infinite0
Infinite (%)0.0%
Mean9.57245283
Minimum0
Maximum228
Zeros1858
Zeros (%)9.3%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median6
Q315
95-th percentile30
Maximum228
Range228
Interquartile range (IQR)14

Descriptive statistics

Standard deviation10.64958027
Coefficient of variation (CV)1.112523661
Kurtosis18.11111445
Mean9.57245283
Median Absolute Deviation (MAD)5
Skewness2.340849526
Sum177569
Variance113.4135599
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12937
14.7%
01858
 
9.3%
101510
 
7.5%
51486
 
7.4%
21319
 
6.6%
3953
 
4.8%
20934
 
4.7%
15776
 
3.9%
8672
 
3.4%
6666
 
3.3%
Other values (66)5439
27.2%
(Missing)1450
 
7.2%
ValueCountFrequency (%)
01858
9.3%
12937
14.7%
21319
6.6%
3953
 
4.8%
4643
 
3.2%
ValueCountFrequency (%)
2281
< 0.1%
2001
< 0.1%
1001
< 0.1%
961
< 0.1%
891
< 0.1%

possui_telefone_celular
Boolean

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
False
20000 
ValueCountFrequency (%)
False20000
100.0%

possui_email
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
1
15984 
0
4016 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%
Histogram of lengths of the category
ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%

Most occurring characters

ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
115984
79.9%
04016
 
20.1%

renda_mensal_regular
Real number (ℝ≥0)

SKEWED

Distinct3031
Distinct (%)15.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean957.1309375
Minimum69
Maximum959000
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum69
5-th percentile289
Q1360
median500
Q3800
95-th percentile1782.05
Maximum959000
Range958931
Interquartile range (IQR)440

Descriptive statistics

Standard deviation11353.965
Coefficient of variation (CV)11.86249922
Kurtosis5062.489381
Mean957.1309375
Median Absolute Deviation (MAD)150
Skewness67.75421325
Sum19142618.75
Variance128912521.2
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
3502808
 
14.0%
500628
 
3.1%
400579
 
2.9%
380546
 
2.7%
600513
 
2.6%
700419
 
2.1%
800388
 
1.9%
450340
 
1.7%
300337
 
1.7%
1000248
 
1.2%
Other values (3021)13194
66.0%
ValueCountFrequency (%)
691
 
< 0.1%
1005
< 0.1%
1051
 
< 0.1%
1151
 
< 0.1%
1205
< 0.1%
ValueCountFrequency (%)
9590001
< 0.1%
8750001
< 0.1%
6680001
< 0.1%
4867781
< 0.1%
1742741
< 0.1%

renda_extra
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct284
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.0969585
Minimum0
Maximum194344
Zeros18930
Zeros (%)94.7%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile100
Maximum194344
Range194344
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1387.42878
Coefficient of variation (CV)35.48687247
Kurtosis19237.66506
Mean39.0969585
Median Absolute Deviation (MAD)0
Skewness137.4095781
Sum781939.17
Variance1924958.62
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
018930
94.7%
350136
 
0.7%
60061
 
0.3%
30058
 
0.3%
40057
 
0.3%
20057
 
0.3%
50053
 
0.3%
80031
 
0.2%
25029
 
0.1%
15025
 
0.1%
Other values (274)563
 
2.8%
ValueCountFrequency (%)
018930
94.7%
11
 
< 0.1%
31
 
< 0.1%
152
 
< 0.1%
31.481
 
< 0.1%
ValueCountFrequency (%)
1943441
< 0.1%
102001
< 0.1%
83411
< 0.1%
54001
< 0.1%
50001
< 0.1%

possui_cartao_visa
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
17822 
1
2178 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%
Histogram of lengths of the category
ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%

Most occurring characters

ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
017822
89.1%
12178
 
10.9%

possui_cartao_mastercard
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
18101 
1
1899 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%
Histogram of lengths of the category
ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%

Most occurring characters

ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
018101
90.5%
11899
 
9.5%

possui_cartao_diners
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
19968 
1
 
32

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
019968
99.8%
132
 
0.2%
Histogram of lengths of the category
ValueCountFrequency (%)
019968
99.8%
132
 
0.2%

Most occurring characters

ValueCountFrequency (%)
019968
99.8%
132
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
019968
99.8%
132
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
019968
99.8%
132
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
019968
99.8%
132
 
0.2%

possui_cartao_amex
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
19959 
1
 
41

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
019959
99.8%
141
 
0.2%
Histogram of lengths of the category
ValueCountFrequency (%)
019959
99.8%
141
 
0.2%

Most occurring characters

ValueCountFrequency (%)
019959
99.8%
141
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
019959
99.8%
141
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
019959
99.8%
141
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
019959
99.8%
141
 
0.2%

possui_outros_cartoes
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
19955 
1
 
45

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
019955
99.8%
145
 
0.2%
Histogram of lengths of the category
ValueCountFrequency (%)
019955
99.8%
145
 
0.2%

Most occurring characters

ValueCountFrequency (%)
019955
99.8%
145
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
019955
99.8%
145
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
019955
99.8%
145
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
019955
99.8%
145
 
0.2%

qtde_contas_bancarias
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
12786 
1
7206 
2
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

qtde_contas_bancarias_especiais
Categorical

HIGH CORRELATION
HIGH CORRELATION

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
12786 
1
7206 
2
 
8

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row0
ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%
Histogram of lengths of the category
ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
012786
63.9%
17206
36.0%
28
 
< 0.1%

valor_patrimonio_pessoal
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct94
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2095.614
Minimum0
Maximum6000000
Zeros19072
Zeros (%)95.4%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum6000000
Range6000000
Interquartile range (IQR)0

Descriptive statistics

Standard deviation44033.43658
Coefficient of variation (CV)21.01218859
Kurtosis17218.03756
Mean2095.614
Median Absolute Deviation (MAD)0
Skewness126.6995194
Sum41912280
Variance1938943537
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
019072
95.4%
2500087
 
0.4%
3000086
 
0.4%
2000083
 
0.4%
5000071
 
0.4%
1500066
 
0.3%
3500063
 
0.3%
4000048
 
0.2%
4500039
 
0.2%
6000037
 
0.2%
Other values (84)348
 
1.7%
ValueCountFrequency (%)
019072
95.4%
71
 
< 0.1%
151
 
< 0.1%
171
 
< 0.1%
182
 
< 0.1%
ValueCountFrequency (%)
60000001
< 0.1%
6000001
< 0.1%
4500001
< 0.1%
3200001
< 0.1%
2500002
< 0.1%

possui_carro
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
13219 
1
6781 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row0
4th row0
5th row1
ValueCountFrequency (%)
013219
66.1%
16781
33.9%
Histogram of lengths of the category
ValueCountFrequency (%)
013219
66.1%
16781
33.9%

Most occurring characters

ValueCountFrequency (%)
013219
66.1%
16781
33.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
013219
66.1%
16781
33.9%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
013219
66.1%
16781
33.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
013219
66.1%
16781
33.9%

vinculo_formal_com_empresa
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
False
11174 
True
8826 
ValueCountFrequency (%)
False11174
55.9%
True8826
44.1%

estado_onde_trabalha
Categorical

HIGH CORRELATION

Distinct28
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
13573 
SP
 
1010
RS
 
819
CE
 
588
BA
 
569
Other values (23)
3441 

Length

Max length2
Median length1
Mean length1.32135
Min length1

Characters and Unicode

Total characters26427
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th rowRS
5th rowBA
ValueCountFrequency (%)
13573
67.9%
SP1010
 
5.1%
RS819
 
4.1%
CE588
 
2.9%
BA569
 
2.8%
MG500
 
2.5%
PE369
 
1.8%
PA316
 
1.6%
PR236
 
1.2%
RJ229
 
1.1%
Other values (18)1791
 
9.0%
Histogram of lengths of the category
ValueCountFrequency (%)
sp1010
15.7%
rs819
12.7%
ce588
 
9.1%
ba569
 
8.9%
mg500
 
7.8%
pe369
 
5.7%
pa316
 
4.9%
pr236
 
3.7%
rj229
 
3.6%
mt224
 
3.5%
Other values (17)1567
24.4%

Most occurring characters

ValueCountFrequency (%)
13573
51.4%
S2204
 
8.3%
P2179
 
8.2%
R1569
 
5.9%
A1276
 
4.8%
E1068
 
4.0%
M1001
 
3.8%
C738
 
2.8%
G718
 
2.7%
B701
 
2.7%
Other values (8)1400
 
5.3%

Most occurring categories

ValueCountFrequency (%)
Space Separator13573
51.4%
Uppercase Letter12854
48.6%

Most frequent character per category

ValueCountFrequency (%)
S2204
17.1%
P2179
17.0%
R1569
12.2%
A1276
9.9%
E1068
8.3%
M1001
7.8%
C738
 
5.7%
G718
 
5.6%
B701
 
5.5%
O297
 
2.3%
Other values (7)1103
8.6%
ValueCountFrequency (%)
13573
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common13573
51.4%
Latin12854
48.6%

Most frequent character per script

ValueCountFrequency (%)
S2204
17.1%
P2179
17.0%
R1569
12.2%
A1276
9.9%
E1068
8.3%
M1001
7.8%
C738
 
5.7%
G718
 
5.6%
B701
 
5.5%
O297
 
2.3%
Other values (7)1103
8.6%
ValueCountFrequency (%)
13573
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII26427
100.0%

Most frequent character per block

ValueCountFrequency (%)
13573
51.4%
S2204
 
8.3%
P2179
 
8.2%
R1569
 
5.9%
A1276
 
4.8%
E1068
 
4.0%
M1001
 
3.8%
C738
 
2.8%
G718
 
2.7%
B701
 
2.7%
Other values (8)1400
 
5.3%

possui_telefone_trabalho
Boolean

HIGH CORRELATION

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.7 KiB
False
14519 
True
5481 
ValueCountFrequency (%)
False14519
72.6%
True5481
 
27.4%

codigo_area_telefone_trabalho
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct77
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
14525 
5
 
631
54
 
442
107
 
407
97
 
264
Other values (72)
3731 

Length

Max length3
Median length1
Mean length1.30375
Min length1

Characters and Unicode

Total characters26075
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st row
2nd row
3rd row
4th row54
5th row
ValueCountFrequency (%)
14525
72.6%
5631
 
3.2%
54442
 
2.2%
107407
 
2.0%
97264
 
1.3%
81196
 
1.0%
29187
 
0.9%
66184
 
0.9%
105182
 
0.9%
58178
 
0.9%
Other values (67)2804
 
14.0%
Histogram of lengths of the category
ValueCountFrequency (%)
5631
 
11.5%
54442
 
8.1%
107407
 
7.4%
97264
 
4.8%
81196
 
3.6%
29187
 
3.4%
66184
 
3.4%
105182
 
3.3%
58178
 
3.3%
20143
 
2.6%
Other values (66)2661
48.6%

Most occurring characters

ValueCountFrequency (%)
14525
55.7%
12296
 
8.8%
51823
 
7.0%
01395
 
5.3%
71385
 
5.3%
2976
 
3.7%
4895
 
3.4%
8810
 
3.1%
6783
 
3.0%
9610
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Space Separator14525
55.7%
Decimal Number11550
44.3%

Most frequent character per category

ValueCountFrequency (%)
12296
19.9%
51823
15.8%
01395
12.1%
71385
12.0%
2976
8.5%
4895
 
7.7%
8810
 
7.0%
6783
 
6.8%
9610
 
5.3%
3577
 
5.0%
ValueCountFrequency (%)
14525
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common26075
100.0%

Most frequent character per script

ValueCountFrequency (%)
14525
55.7%
12296
 
8.8%
51823
 
7.0%
01395
 
5.3%
71385
 
5.3%
2976
 
3.7%
4895
 
3.4%
8810
 
3.1%
6783
 
3.0%
9610
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII26075
100.0%

Most frequent character per block

ValueCountFrequency (%)
14525
55.7%
12296
 
8.8%
51823
 
7.0%
01395
 
5.3%
71385
 
5.3%
2976
 
3.7%
4895
 
3.4%
8810
 
3.1%
6783
 
3.0%
9610
 
2.3%

meses_no_trabalho
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0089
Minimum0
Maximum32
Zeros19973
Zeros (%)99.9%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum32
Range32
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3888808962
Coefficient of variation (CV)43.69448272
Kurtosis4536.037419
Mean0.0089
Median Absolute Deviation (MAD)0
Skewness63.19895877
Sum178
Variance0.1512283514
MonotocityNot monotonic
Histogram with fixed size bins (bins=13)
ValueCountFrequency (%)
019973
99.9%
17
 
< 0.1%
34
 
< 0.1%
24
 
< 0.1%
62
 
< 0.1%
52
 
< 0.1%
42
 
< 0.1%
151
 
< 0.1%
301
 
< 0.1%
141
 
< 0.1%
Other values (3)3
 
< 0.1%
ValueCountFrequency (%)
019973
99.9%
17
 
< 0.1%
24
 
< 0.1%
34
 
< 0.1%
42
 
< 0.1%
ValueCountFrequency (%)
321
< 0.1%
301
< 0.1%
181
< 0.1%
151
< 0.1%
141
< 0.1%

profissao
Real number (ℝ≥0)

MISSING
ZEROS

Distinct18
Distinct (%)0.1%
Missing3097
Missing (%)15.5%
Infinite0
Infinite (%)0.0%
Mean8.045080755
Minimum0
Maximum17
Zeros1398
Zeros (%)7.0%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q19
median9
Q39
95-th percentile11
Maximum17
Range17
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.210790149
Coefficient of variation (CV)0.3990998035
Kurtosis1.645508794
Mean8.045080755
Median Absolute Deviation (MAD)0
Skewness-1.485432434
Sum135986
Variance10.30917338
MonotocityNot monotonic
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
912103
60.5%
01398
 
7.0%
111349
 
6.7%
21171
 
5.9%
12192
 
1.0%
10173
 
0.9%
16126
 
0.6%
13125
 
0.6%
789
 
0.4%
861
 
0.3%
Other values (8)116
 
0.6%
(Missing)3097
 
15.5%
ValueCountFrequency (%)
01398
7.0%
11
 
< 0.1%
21171
5.9%
37
 
< 0.1%
413
 
0.1%
ValueCountFrequency (%)
1716
 
0.1%
16126
0.6%
1525
 
0.1%
146
 
< 0.1%
13125
0.6%

ocupacao
Real number (ℝ≥0)

MISSING
ZEROS

Distinct6
Distinct (%)< 0.1%
Missing2978
Missing (%)14.9%
Infinite0
Infinite (%)0.0%
Mean2.533309834
Minimum0
Maximum5
Zeros1114
Zeros (%)5.6%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q34
95-th percentile5
Maximum5
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.532765217
Coefficient of variation (CV)0.6050445137
Kurtosis-1.064418569
Mean2.533309834
Median Absolute Deviation (MAD)1
Skewness0.3443705658
Sum43122
Variance2.34936921
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
26882
34.4%
13144
15.7%
42924
14.6%
52822
14.1%
01114
 
5.6%
3136
 
0.7%
(Missing)2978
14.9%
ValueCountFrequency (%)
01114
 
5.6%
13144
15.7%
26882
34.4%
3136
 
0.7%
42924
14.6%
ValueCountFrequency (%)
52822
14.1%
42924
14.6%
3136
 
0.7%
26882
34.4%
13144
15.7%

profissao_companheiro
Real number (ℝ≥0)

MISSING
ZEROS

Distinct14
Distinct (%)0.2%
Missing11514
Missing (%)57.6%
Infinite0
Infinite (%)0.0%
Mean3.708107471
Minimum0
Maximum17
Zeros5551
Zeros (%)27.8%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q311
95-th percentile11
Maximum17
Range17
Interquartile range (IQR)11

Descriptive statistics

Standard deviation5.181240608
Coefficient of variation (CV)1.397273582
Kurtosis-1.355947696
Mean3.708107471
Median Absolute Deviation (MAD)0
Skewness0.7295114844
Sum31467
Variance26.84525424
MonotocityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
05551
27.8%
112358
 
11.8%
9409
 
2.0%
1678
 
0.4%
239
 
0.2%
1215
 
0.1%
1013
 
0.1%
69
 
< 0.1%
135
 
< 0.1%
173
 
< 0.1%
Other values (4)6
 
< 0.1%
(Missing)11514
57.6%
ValueCountFrequency (%)
05551
27.8%
11
 
< 0.1%
239
 
0.2%
31
 
< 0.1%
69
 
< 0.1%
ValueCountFrequency (%)
173
 
< 0.1%
1678
0.4%
141
 
< 0.1%
135
 
< 0.1%
1215
 
0.1%

grau_instrucao_companheiro
Real number (ℝ≥0)

MISSING
ZEROS

Distinct6
Distinct (%)0.1%
Missing12860
Missing (%)64.3%
Infinite0
Infinite (%)0.0%
Mean0.2880952381
Minimum0
Maximum5
Zeros6485
Zeros (%)32.4%
Memory size156.4 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.9443388652
Coefficient of variation (CV)3.277870441
Kurtosis8.751956221
Mean0.2880952381
Median Absolute Deviation (MAD)0
Skewness3.183826767
Sum2057
Variance0.8917758923
MonotocityNot monotonic
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
06485
32.4%
3245
 
1.2%
4244
 
1.2%
2132
 
0.7%
122
 
0.1%
512
 
0.1%
(Missing)12860
64.3%
ValueCountFrequency (%)
06485
32.4%
122
 
0.1%
2132
 
0.7%
3245
 
1.2%
4244
 
1.2%
ValueCountFrequency (%)
512
 
0.1%
4244
1.2%
3245
1.2%
2132
0.7%
122
 
0.1%

local_onde_reside
Real number (ℝ≥0)

HIGH CORRELATION

Distinct743
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean581.29525
Minimum105
Maximum999
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum105
5-th percentile148
Q1444
median596
Q3728
95-th percentile956
Maximum999
Range894
Interquartile range (IQR)284

Descriptive statistics

Standard deviation227.369798
Coefficient of variation (CV)0.3911433957
Kurtosis-0.5758248011
Mean581.29525
Median Absolute Deviation (MAD)144
Skewness-0.2500355883
Sum11625905
Variance51697.02503
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
960367
 
1.8%
591345
 
1.7%
570310
 
1.6%
456256
 
1.3%
628249
 
1.2%
685222
 
1.1%
596205
 
1.0%
689196
 
1.0%
619194
 
1.0%
581189
 
0.9%
Other values (733)17467
87.3%
ValueCountFrequency (%)
1051
 
< 0.1%
11011
 
0.1%
1121
 
< 0.1%
11383
0.4%
11446
0.2%
ValueCountFrequency (%)
9992
 
< 0.1%
9981
 
< 0.1%
9974
< 0.1%
9962
 
< 0.1%
9958
< 0.1%

local_onde_trabalha
Real number (ℝ≥0)

HIGH CORRELATION

Distinct743
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean581.29525
Minimum105
Maximum999
Zeros0
Zeros (%)0.0%
Memory size156.4 KiB

Quantile statistics

Minimum105
5-th percentile148
Q1444
median596
Q3728
95-th percentile956
Maximum999
Range894
Interquartile range (IQR)284

Descriptive statistics

Standard deviation227.369798
Coefficient of variation (CV)0.3911433957
Kurtosis-0.5758248011
Mean581.29525
Median Absolute Deviation (MAD)144
Skewness-0.2500355883
Sum11625905
Variance51697.02503
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
960367
 
1.8%
591345
 
1.7%
570310
 
1.6%
456256
 
1.3%
628249
 
1.2%
685222
 
1.1%
596205
 
1.0%
689196
 
1.0%
619194
 
1.0%
581189
 
0.9%
Other values (733)17467
87.3%
ValueCountFrequency (%)
1051
 
< 0.1%
11011
 
0.1%
1121
 
< 0.1%
11383
0.4%
11446
0.2%
ValueCountFrequency (%)
9992
 
< 0.1%
9981
 
< 0.1%
9974
< 0.1%
9962
 
< 0.1%
9958
< 0.1%

inadimplente
Categorical

HIGH CORRELATION
UNIFORM

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.4 KiB
0
10000 
1
10000 

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters20000
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
010000
50.0%
110000
50.0%
Histogram of lengths of the category
ValueCountFrequency (%)
110000
50.0%
010000
50.0%

Most occurring characters

ValueCountFrequency (%)
010000
50.0%
110000
50.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number20000
100.0%

Most frequent character per category

ValueCountFrequency (%)
010000
50.0%
110000
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common20000
100.0%

Most frequent character per script

ValueCountFrequency (%)
010000
50.0%
110000
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII20000
100.0%

Most frequent character per block

ValueCountFrequency (%)
010000
50.0%
110000
50.0%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
The dendrogram allows you to more fully correlate variable completion, revealing trends deeper than the pairwise ones visible in the correlation heatmap.

Sample

First rows

id_solicitanteproduto_solicitadodia_vencimentoforma_envio_solicitacaotipo_enderecosexoidadeestado_civilqtde_dependentesgrau_instrucaonacionalidadeestado_onde_nasceuestado_onde_residepossui_telefone_residencialcodigo_area_telefone_residencialtipo_residenciameses_na_residenciapossui_telefone_celularpossui_emailrenda_mensal_regularrenda_extrapossui_cartao_visapossui_cartao_mastercardpossui_cartao_dinerspossui_cartao_amexpossui_outros_cartoesqtde_contas_bancariasqtde_contas_bancarias_especiaisvalor_patrimonio_pessoalpossui_carrovinculo_formal_com_empresaestado_onde_trabalhapossui_telefone_trabalhocodigo_area_telefone_trabalhomeses_no_trabalhoprofissaoocupacaoprofissao_companheirograu_instrucao_companheirolocal_onde_residelocal_onde_trabalhainadimplente
01110presencial1M852001CECEY1071.012.0N0480.00.000000110.01NN09.01.00.00.0600.0600.00
12125internet1F381001SESEY911.05.0N1380.00.000000000.00NN02.05.0NaNNaN492.0492.00
23120internet1F372001BABAY905.01.0N1600.00.000000000.00NN0NaNNaNNaNNaN450.0450.01
34120internet1M371101RSRSY541.01.0N1460.00.000000000.00YRSY5409.02.0NaNNaN932.0932.01
4571internet1F511301BABAY860.01.0N1687.0600.000000000.01YBAN09.05.0NaNNaN440.0440.01
56120presencial1M211101CECEY1075.02.0N0382.00.010000000.01YCEY10709.02.00.00.0628.0628.01
67115presencial1F644201SPSPY161.00.0N1350.00.000000110.01NN010.01.00.00.0190.0190.01
7815internet1F201001ESESY251.05.0N1800.00.000000000.00NN0NaNNaNNaNNaN299.0299.01
89225internet1F392201GOGOY671.03.0N11200.00.010000000.00YY6909.02.09.04.0756.0756.00
910110presencial1M442201RSRSN1.015.0N0749.00.000000110.01YRSN09.02.016.04.0960.0960.01

Last rows

id_solicitanteproduto_solicitadodia_vencimentoforma_envio_solicitacaotipo_enderecosexoidadeestado_civilqtde_dependentesgrau_instrucaonacionalidadeestado_onde_nasceuestado_onde_residepossui_telefone_residencialcodigo_area_telefone_residencialtipo_residenciameses_na_residenciapossui_telefone_celularpossui_emailrenda_mensal_regularrenda_extrapossui_cartao_visapossui_cartao_mastercardpossui_cartao_dinerspossui_cartao_amexpossui_outros_cartoesqtde_contas_bancariasqtde_contas_bancarias_especiaisvalor_patrimonio_pessoalpossui_carrovinculo_formal_com_empresaestado_onde_trabalhapossui_telefone_trabalhocodigo_area_telefone_trabalhomeses_no_trabalhoprofissaoocupacaoprofissao_companheirograu_instrucao_companheirolocal_onde_residelocal_onde_trabalhainadimplente
1999019991110presencial1F524001SPPRN1.00.0N0350.00.000000110.01NN00.01.00.00.0872.0872.01
1999119992110presencial1M482201MGMGN1.06.0N01308.00.000000110.01NN00.01.00.00.0351.0351.01
199921999315internet1M624001ESRJY201.030.0N1358.00.000000000.00NN09.01.0NaNNaN230.0230.00
199931999415internet1F181001RJRJY221.06.0N1405.00.000000000.00NN09.02.0NaNNaN289.0289.00
199941999525presencial1M232001BABAY841.023.0N0350.00.000000110.01NN00.00.00.00.0457.0457.01
1999519996110presencial1M272001MGMGY292.00.0N1423.00.000000110.01YN09.01.00.00.0308.0308.00
1999619997120presencial1F262101CECEY1071.03.0N0350.00.000000110.01YN09.02.00.00.0639.0639.00
1999719998110internet1F632001BABAY865.025.0N1321.00.000000000.00NN09.01.0NaNNaN486.0486.00
199981999915internet1F841001PBRNN1.030.0N1380.00.000000000.00NN0NaNNaNNaNNaN590.0590.00
1999920000220presencial1F531001MASPY51.011.0N1300.00.000000110.01NN09.05.00.00.0132.0132.00